B3 

0 Publieation number: 0 243 153 

A2 



® EUROPEAN PATENT APPLICATION 

® Application number: 87303509.1 ® Inta*: C 12 P 21/02 

- C 12 N 15/00 

@ Date of filing: 22.04.87 



@ Priority: 22.04.86 US 856643 


@ Inventor: Cosmam David John 


14.11.86 US 931468 


11611thAvanueEastNo.301 




Seattle Washington 98102(US) 


@ Date of publication of appHcation: 


@ inventor: GliUs, Stephen 


26.10.87 Bulletin 87/44 


3455 West MercerWay 


(&) Designated Contracting States: 


Mercer Island Washington 98040(US) 


@ Inventor: MOchlzuld, Diane YuMko 


AT BE CH DE ES FR GB 6R IT U LU NL SE 


@ Appitcant: IMMUNEX CORPORATION 


4805 Stanford Avenue Northeast 


Seattle Washington 98105(US) 


51 University Street Suite 600 600 Immunex Building 


@ Inventor: March, Carl Jacic 


Seattle Washington 98101 (US) 


8133 8th Southwest 


@ inventor: Cerretti, Douglas Pat 


Seattle Washington 98106(US) 


1607 North 197th Place 


(2) Inventor: Price, Virginia Lee 


Seattle Washington 98133(US) 




2617 Boyer Avenue East 




Seettle Washington 98102(US) 




@ Inventor: Tushlnskl, Robert J. 




2024 Northwest 62nd Street 




Seattle Washlngton98107(US) 




® Inventor: Urdal, David Uoyd 




6826 55th Avenue Northeast 




Seattle Washington 98115(US) 




(74) Representative: Sheard, Andrew Gregory at al. 




KHburn & Strode 30, John Street 




London WC1N 2DD(GB) 



@ Human G-CSF protein expression. 

(S) Human granulocyte colony stimulating factor (hG-CSF) 
and related mutant cDNAs and proteins are disclosed, 
characterized by molecular modifications Increasing ex- 
pression in microbial systems. Related aspects concern 
recombinant yeast and bacterial expression vectors for 
producing the proteins in high yield, and processes for 
making the proteins using microbial expression systems. 
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TITLE 

Human 6-CSF Protein Expression 

BACKGROUND OP THE INVENTION 
The present invention relates generally to lymphoklnes, and 
particularly to growth factors which induce differentiation and 
proliferation of human hematopoietic progenitor cells. 

Human blood cells derive from pluripotent hematopoietic stem 
cells originating in the bone marrow. These stem cells provide 
progenitor cells for erythrocytes, polymorphonuclear granulocytes, 
monocytes, B and T lymphocytes and platelets. Polymorphonuclear 
granulocytes, characterized by lobulated nuclei and abundant 
pseudoplasmic granules, differentiate into neutrophils, eosinophils 
and basophils. Monocytes and lymphocytes differentiate along 
particular pathways to transform into macrophages which are 
indistinguishable from typical tissue macrophages. 

The differentiation and proliferation of hematopoietic cells 
is regulated by secreted glycoproteins collectively referred to as 
colony stimulating factors (CSFs). In murine and human systems, these 
proteins include granulocyte-macrophage colony stimulating factor 
(GH-CSF), which promotes granulocyte and macrophage production from 
normal bone marrow., and which also appears to regulate the activity of 
mature, differentiated granulocytes and macrophages. Other CSFs 
Include macrophage CSF (M-CSF or CSF-1), which induces the selective 
proliferation of macrophages, and burst promoting activity (EPA), 
which induces development of erythroid cell progenitors into 
hemoglobin-containing cells. An additional murine CSF, known 
variously as IL-3 or multi-CSF, stimulates the development of numerous 
cell types of hematopoietic lineage. Gibbon and human homologues of 
murine IL-3 have been reported by Yang et al., Cell 47:3 (1986). 

Granulocyte-specific colony stimulating factors (G-CSFs) 
induce differentiation and expansion of granulocyte-committed 
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progenitor cells from multipotent hematopoietic stem cells. When 
administered to mammals, G-CSFs promote a dramatic Increase in 
circulating granulocyte populations. 

Murine and human granulocyte colony stimulating factors have 
been isolated and partially characterized; Murine 6-CSF vas first 
purified from endo toxin-stimulated mouse lung conditioned medium, .as 
described by Nicola et al«, J> Biol. Chem. 258:9017 (1983). The 
murine protein induced terminal differentiation of the VEBI-3B murine 
myelomonocytic leukemia cell line, and stimulated neutrophilic 
granulocyte colonies in semisolid agar. At lov concentrations, murine 
6-CSF stimulated granulocyte colony precursors exclusively; at higher 
concentrations, it stimulated mixed granulocyte/macrophage colony 
formation. Isolation and sequencing of a cMlA encoding murine G-CSF 
vas reported by Tsuchiya et al., Proc. Matl, Acad. Sci> USA 83:7633 
(1986). 

Velte et al.. Proc- Natl. Acad, Sci. USA 82:1526 (1983) 
reported partial purification and preliminary characterization of a 
CSF found in supematants of the human bladder carcinoma cell line BBT 
5637. This factor, termed "pluripotent hematopoietic colony- 
stimulating factor", supported growth of human mixed colonies, 
granulocyte-macrophage colonies, and early erythroid colonies. This 
CSF also induced terminal differentiation of the VEHI-3B cell line. 
An apparent molecular veight of about 18,000 vas reported. 

Nomura et al., EMBO Journal 5:871 (1986), established a human 
oral cavity carcinoma cell line (CHU-2), vhlch constitutively produced 
a granulocyte-lineage specific CSF. This 19,000 dalton protein 
exhibited an assay profile largely coincident vith that of murine 
G--CSF. Nagata et al., Nature 319:415 (1986) isolated tvo dMlAs for 
the CHU-2 G-CSF protein vhich differed vith respect to a three amino 
acid insert. When expressed in mammalian COS cells, both cDNAs 
provided proteins vith G-CSF activity. Sequencing of the cDNAs and a 
genomic G-CSF clone suggested that the sequence containing the insert 
vas the product of an alternative mRNA splice. The protein lacking 
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the Insert had greater biological activity. 

Sotiza ct al., Science 232:61 (1986) purified G-CSF from HBT 
5637 cell supernatants, and isolated a cDNA from an HBT 5637 library 
which had a sequence homologous to the CHU-2 G-CSF clone lacking the 
three codon insert. Souza et al. cloned and expressed the HBT 5637 
G-CSF sequence in Escherichia coli to provide active protein. 

Due to potential clinical utility as a stimulator of 
granulocytic cell precursors, there is considerable interest in G-(S? 
in the hematology and oncology communities. Therapeutic compositions ... 
vith 6-CSF activity could be employed to potentiate immune 
responsiveness to infectious pathogens, or to assist in reconstituting 
normal blood cell populations following radiation- or 
chemotherapy-induced hematopoietic cell suppression. G-CSF may also 
find application in the treatment of certain leukemias, due to its 
ability to cause differentiation of certain neoplastic cells of 
hematopoietic lineage. The present invention concerns novel mutant 
G-CSF proteins which are expressed and secreted in high yields by 
yeast cells. 

SUMMARY OF THE INVENTION 
The present invention provides purified human granulocyte 
colony stimulating factor (hG-CSF) as a homogeneous protein, processes 
for its production from cell cultures, and cDNA's which encode hG-CSF. 
In related aspects, the present invention is directed to human G-CSF . 
(hG-CSF) muteins having A/T enriched 3' coding regions and/or 
inactivated yeast KEX2 protease processing sites. These mutant 
proteins are expressed in very high yield in yeast and bacterial 
expression systems. The invention also concerns recombinant expression 
vectors comprising nucleotide sequences encoding the muteins, related 
microbial expression systems comprising appropriate host organisms 
transformed with the foregoing recombinant expression vectors, and 
processes for making the muteins using the microbial expression 
systems. 
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BRIEF DESCBIPTIOH OF THE mVINGS 

FIG. 1 compares the nucleotide sequences and corresponding 

22 

amino acid sequences of native human G-CSF and hG-CSF[Lys ]i a 
preferred mutein having an inactivated KE'X2 site and other nucleotide 
changes resulting in enhanced expression. FIG. I also indicates the 
position of preferred locations for elimination or addition of 
cysteine residues to stabilize expressed protein. 

FIG. 2 is a schematic representation of the construction of 
plasmid pBC102-K22, which comprises a cIWA sequence encoding 
hG-CSFILys^^l. This plasmid, resident in E. coll strain RRl, has 
been deposited with the American Type Culture Collection (ATCC) under 
accession number 67»255« 



DETAILS OF THE INVEOTICMI 
Crude preparations of hG-CSF are prepared by initially 
culturing malignant, neoplastic hematopoietic growth factor producing 
cells in vitro in a semm containing medium supplemented with various 
additives. Conditioned medium is prepared from the cells by first 
growing the cells to confluence and then removing the medium from the 
cells for replacement with fresh culture medium containing serum. 
After an appropriate culture duration, the medium is harvested and 
processed to purify the hematopoietic growth factor into a more 
concentrated form. 

Various cells and cell lines can be employed in the 
production of crude hematopoietic growth factor, including various 
malignant neoplastic cell lines, for example carclnooia, sarcoma, 
lymphoma, leukemia, melanoma and myeloma cell lines. An exemplary 
cell line is the human bladder carcinoma cell line HBT 5637, which is 
publicly available from the American Type Culture Collection, 12301 
Farklawn Drive, Rockville, MD 20852, USA, under accession number 
HTB-9. 
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Suitable culture media for production of crude hG-CSF include 
such coDunercially available media as Roswell Park Memorial Institute 
(RPHI)-1600 medium, Dulbecco's Modified Eagle Medium (DHEH) and 
Click's Medium. Additives such as the antibiotics penicillin, 
5 streptomycin, and gentamycin can be added- to media. Additional 
additives may include various buffers, such as HEPES buffer 
(N-2-hydroxyethylpiperazine-N-2-ethane-sulfonic acid). Further 
additives may include L-glutamine, NaHCO^, 2-fflercaptoethanol and 
various serums, Including fetal calf serum (PCS) or normal human 
LO serum. Preferably, cultures are maintained at 35-38"C in a humidified 
atmosphere of 5-101 CO^ in air, at pH 7.0-7.4. 

Various assays can be employed to detect granulocyte-speciflc 
hematopoietic growth factors. A first assay, referred to herein as a 
"bone marrow assay" measures capacity of a sample to induce 
X5 proliferation of bone marrow cells obtained from both human or murine 
sources. Cells are stained to visualize the cell types present in 
induced colonies. hG-CSF induces formation of mature granulocytes and 
neutrophils from both human and murine bone marrow. 

A second assay measures capacity of a sample to cause 
20 proliferation of IL-3 dependent cells such as 32D cells, FDC-P2 cells, 
FDC-Pl cells and FDC-P2-ld cells, all of which are widely available. 
32D cells are derived from a retrovirus-infected C3H-BeJ mouse bone 
marrow culture, as described by Greenburger, et al.. Fed. Proc. 
42:2762 (1983). hC-CSP scores positively in IL-3 dependent cell 
25 proliferation assays, indicating that the activity of the factor is 
not restricted to simply granulocytes or macrophages. 

A third assay for granulocyte-specif ic CSF involves measuring 
the capacity of a sample to cause leukemic cells to differentiate into 
mature leukocytes. Exemplary cell lines for this purpose include the 
30 murine myelomonocytic WEHI-3 (ATCC TIB-68) cells and the human 

myelocytic leukemia cell line, HL60, from the human erythroleukemia, 
KGl. This particular assay procedure of the present invention has 
been conducted with WEBI-3B D+ cells. Nicola and Hetcalf , J- Cell. 
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p^^ys^ 109:253 (1981) describe use of WEHI-3B D+ cells to detect the 

presence of G-CSF. 

Crude hG-CSF produced in supernatants of HBT 5637 or other 
suitable cells is Initially purified by anmonium sulfate 
5 precipitation, followed by gel coluan chromatography and then by 
multiple HPLC procedures. The progress of purification can be 
monitored by gel electrophoresis followed by silver staining. 

Preferred gel filtration media Include agarose, Sephadex, 
polyacrylamlde, (e.g., Bio^el). or mixtures of the foregoing such as 
10 AcA-44 ultragel or AcA-54 ultragel. AcA.54 ultragel is most 

preferred. hG-CSF derived from crude culture supernatants eluted from 
gel filtration columns In a molecular weight range from 15,000-30,000 
daltons. 

Gel filtration fractions containing hG-CSF activity are 
15 pooled for preparative HPLC procedures, preferably employing a reverse 
phase, octadecyl bonded silica column having a pore size of at least 
15-20 microns. Such columns include the Radlopak and Porasll lines of 
columns commercially available from Waters Associates (Milford, HE, 
USA). Prior to applying the hematopoietic growth factor to the 
20 column, fractions are diluted and the column equilibrated with a 

buffer comprising trlfluoroacetlc acid (TFA), heptafluorobutyrlc acid 
(HFBA) or acetic acid. hG-CSF can be eluted from RP-HPLC columns 
using a linear gradient of acetonitrile or an N-propanol buffer 
solution m TFA, HFBA or acetic add. If acetonitrile is used as 
25 eluant, a preferred gradient is 0-100% (V/V) of acetonitrile in TFA, 
applied to the column at a rate of approximately IX acetonitrile per 
minute. If the eluant is H-propanol, a preferred gradient is 60Z 
(V/V) n-propanol in 0.9 H acetic acid and 0.2 H pyridine (pH 4.0). A 
preferred gradient range for this buffer eluant is from 0-lOOZ and, 
30 more preferably, from 0-84X. 

If sufficient protein purification is not achieved by the 
initial HPLC procedure, it can be repeated, using similar or different 
elution buffers. As detailed below, a single HPLC step using C14 
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bonded phase and elution with aqueous TPA did not sufficiently purify 
hG-CSF from cell media to provide a single specific protein, band by 
gel electophoresis. However, after a second HPLC treatment employing 
the same column but with an N-propanol/acetic acid/pyridine eluant, a 
5 distinct band at about 19,000 daltons was recovered which corresponded 
to hematopoietic growth factor activity. 

To achieve homogeneity, a third HPLC step was undertaken, 
employing a C18 column equilibrated with TFA, which was eluted with an 
acetonitrile gradient. Following this step, homogeneous hematopoietic 
10 growth factor activity was observed in a single fraction having ai 

molecular weight of approximately 19,000 daltons. The hematopoietic 
growth factor (hG-CSF) was found to have a specific activity of about 
10^ units per microgram (U/pg) in the 32D cell line proliferation 
assay and 10^ colony forming units per microgram (CFU/yg) in the 
15 murine bone marrow colony forming assay. 

The hematopoietic growth factor activity was found to be 
stable for at least six months when stored at in organic buffers 
used in HPLC fractionation, supra , (TFA/acetonitrile) or 
(pyridine/acetate/propanol) • 
20 Samples of the homogeneous protein were analyzed for amino 

acid composition and sequence using an automated sequencer employing 
either ninhydrin or gas phase detection. The first 25 residues of the 
amino terminal portion of the hematopoietic growth factor of the 
present invention are composed of the following sequence: 
25 Thr^Pro-Leu-Gly-Pro-Ala-Ser-Ser-Leu-Pro-Gln-Ser-Phe-His- 
Cys-Lys-Asn-Leu-Glu-Gln-Val-Arg-Ile-Lys-Ile. 

A DNA segment encoding native human G-CSF was isolated from a 
cDNA library prepared by reverse transcription of polyadenylated RNA 
isolated from a human bladder carcinoma cell line, HBT 5637 (ATCC 
30 HBT-9). Synthetic oligonucleotide probes having sequence homology to 
N- terminal, C- terminal, and 3' non-coding regions of the native human 
mRNA sequence were employed to screen the library by conventional DNA 
hybridization techniques. DNA was isolated from those clones which 
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hybridized to the probes end analyzed by restriction endonudease 
cleavage, agarose gel electrophoresis, and additional hybridization 
experiments ("Southern blots") involving the electrophoresed 
fragments. After isoUting a single clone which hybridized to each of 
the probes, the hybridizing segment bearing the hG-CSF gene was then 
subdoned and sequenced by conventional techniques. A part of this 
fragment, encoding amino acids 43-174 of mature h6-CSF, was identified 
and cleaved from the remainder of the fragment with a particular 
restriction endonudease. This DMA segment was ligated to a synthetic 
N-terminal sequence comprising a 3 codon deletion and added internal 
restriction sites. This construct was then altered by site-specific 
mutagenesis procedures to provide a mutant analog sequence in which 
the codon specifying Arg^^ (AGA) was replaced by a codon specifying 
Lys (AAA). 

The mutant or dtered sequence was then inserted into a yeast 
expression vector under control of a particular promoter. The vector 
was used to transform an appropriate yeast expression strain, which 
vas grown in culture under conditions promoting derepression of the 
yeast promoter. The resulting yeast-conditioned culture^supernatant 
was assayed to confirm enhanced expression of hG-CSFlLys ]. 

Definitions 

"Human granulocyte colony stimulating factor" and "hG-CSF" 
refer to a human endogenous secretory protein which selectively 
Induces development of granulocyte-commltted progenitors from 
multlpotent hematopoietic cells. 

"Mutant amino add sequence" refers to a polypeptide encoded 
by a nucleotide sequence Intentionally made variant from a native 
sequence. "Mutant protein" or "outein" means a protein comprising a 
mutant amino acid sequence. "Substantially homologous," which can 
refer both to nucleic add and amino acid sequences, means that a 
particular subject sequence, for example, a mutant sequence, varies 
from a reference sequence by one or more substitutions, deletions, or 
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additions, the net effect of which do not result in an adverse 
functional dissimilarity between reference and subject sequences. For 
purposes of the present invention, sequences having greater than 80 
percent homology* equivalent biological specific activity, and 
equivalent expression characteristics are -considered substantially 
homologous. Sequences having lesser degrees of homology, comparable 
bioactivity, and equivalent expression characteristics are considered 
equivalents. "Native sequence** refers to an amino acid or nucleic 
acid sequence which is identical to a wild-type or native form of a 
gene or protein. **K£X2 protease recognition site** is defined below. 
The terra "inactivate", as used in defining the present invention, 
means to alter a selected KEX2 protease recognition site to retard or 
prevent cleavage by the KEX2 protease of Saccharomyces cerevisiae . 

"Recombinant," as used herein, means that a protein is 
derived from recombinant microbial (e.g., bacterial or fungal) 
expression systems. "Crude culture supernatant" refers to media 
withdrawn from yeast or bacterial cultures which has not been 
subjected to concentration or purification procedures. 

"DNA segment" refers to a DNA polymer, in the form of a 
separate fragment or as a component of a larger DNA construct, which 
has been derived from DNA isolated at least once in substantially pure 
form, i.e., in a quantity or concentration enabling identification, 
manipulation, and recovery of the segment and its component nucleotide 
sequences by standard biochemical methods, for example, using a 
cloning vector. "Nucleotide sequence" refers to a heteropolymer of 
deoxyribonucleo tides. "Recombinant expression vector" refers to a 
plasmid comprising a transcriptional unit comprising an assembly of 
(1) a genetic element or elements having a regulatory role in gene 
expression, for example, promoters or enhancers, and (2) a structural 
or coding sequence which is transcribed into mRNA and translated into 
protein. Preferably, the transcriptional unit includes a leader 
sequence enabling extracellular secretion of translated protein by a 
host cell. "Recombinant expression system" means a combination of an 
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expression vector and a suitable host microorganism. ^'Substantially 

pure'' means at least 75%, 80% , 90%, 95% or 99% by weight pure in 
increasing order of preference. 

Assays for hG-CSF Biological Activity 
Assays used to measure hG-CSF biological activity are 
described belov. 

1. Human and Murine Bone Harrov Colony Assays 

In this assayi 50 vl samples are plated in appropriate veils 
in log-2 dilution series. A l.AX agar suspension is prepared by 
heating in a boiling water bath, and then held at 40*C prior to use. 
An incubation medium is prepared by mixing seven parts nutrient medium 
[a-minimum essential medium (odlEM) supplemented with vitamins, 28.5% 
fetal bovine serum (FBS), 0.7 x 10* M 2-mercaptoethanol, 0.12 mg/ml 
asparagine, 0.7 mg/ol glutamine, 150 0/ml penicillin G, and 150 U/ml 
streptomycin] and three parts agar suspension, and held at 37®C. 
Percoll treated bone marrow cells are warmed to 37 and added to the 
incubation medium to provide a final concentration of approximately 1 
X 10^ cells/ml. The resulting mixture Is kept at 37"C while 
dispensing 250 vl aliquots into each well. Plates are held at about 
23 ®C until the agar solidifies, then incubated at 37*C in plastic 
boxes containing distilled water to prevent desiccation. 

Colonies having 50 or more cells each are counted on days 7 
or 10 and 14. Earlier counts are better for granulocyte colonies, 
while later counts are better for macrophage and mixed colonies. In 
each assay, several wells are plated without hematopoietic growth 
factor samples to obtain a background colony count. hG-CSF activity, 
expressed in colony forming units per milliliter ("CFU/ml"), is 
defined as that sample dilution providing one-half of the maximum 
colonies formed by 1 x 10^ bone marrow cells, multiplied by the number 
of colonies observed in the half maximal case. 

Cell types in colonies are determined by staining individual 
cells with a stain consisting of 0-6X Orcein and 60% acetic acid. 
hG-CSF samples can also be assayed using murine bone marrow cells in a 
protocol substantially similar to that described above. In the murine 
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assay, horse serum is used in place o£ FBSt marrow cells are plated at . 
a concentration of 5 x 10^ cells per ml incubation medium, and 
colonies are counted on days 4 or 5 and 7. 
2. WEHI Differentiation Assay 

This assay determines the capacity of a sample to induce 
terminal differentiation of a murine myelomonocytlc leukemic cell 
line, VEHI-3B D+, This cell line has been widely distributed among 
researchers and is available from numerous sources. Including Immunex 
Corporation, 51 University Street, Seattle, Vashington, 9B101. In the 
presence of phorbol myristate acetate (PMA), granulocytes resulting 
from hG-CSF"lnduced differentiation of WEHI-3B D-^ cells convert 
nitro-blue tetrazollura (NET) intracellularly from yellow to blue. 

VEHI-3B D4> cells are maintained in culture in a-M£M 
supplemented with 15% FBS, 2 x 10"^ M 2-mercaptoethanol, 50 U/ml 
penicillin, 50 pg/ml streptomycin and 300 yg/ml fresh L-glutamlne 

complete a-MEM"), and are subcultured every 3-A days. The cells are 
counted and seeded into tissue culture flasks in 25-50 ml of medium at 
a density of approximately 1 x 10^ cells/ml. The cells double in 15X 
FBS every 1-2 days. 

In the assay, SO pg of complete a-HEH are dispensed into each 
well of a 96-well plate. Duplicate twofold serial dilutions of each 
sample are prepared, and 60 yl of a VEHI-3B D-i- cell suspension (1 x 
10^ cells/ml) is dispersed into each well. Plates are incubated at 
37'C in the presence of 7X Oj, 10% COj at high humidity. After 72 
hours, 10 \il of a fresh NBT-PMA solution (2 mg/ml NBT and 40 gg/ml PMA 
in PBS) is dispensed into the first row of the duplicate rows and 100 
yl of NBT (2 mg/ml in FBS) is dispensed into the second row* After an 
additional 1-2 hours at 37^C, 25 ul of 40 mM HCl in isopropanol is 
added to each well. Absorbance at 570 nH is then determined. Dye 
conversion is proportional to the number of granulocytes present, 
which, in turn, is proportional to hG-CSF activity. Units of activity 
in the sample are equal to the reciprocal of the dilution providing 
one-half maximum NBT conversion. 
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3> 32D Cell Proliferation Assay 

This assay ascertains the capacity of a sample to induce 
proliferation of 32D cells, vhich are derived from retrovirus-infected 
C3B-HeJ mouse bone marrov cells. 32D cells have been widely 
distributed among researchers and are commonly available from many 
sources, including Immunex Corporation. 

32D cells are routinely cultured in RPHI-1640 containing 5% 
(v/v) yEHI-3 conditioned medium, prepared from a 48 hour culture of 
WEHI-3 cells (1-5 x 10^ cells/ml) in RPMI-1640 medium supplemented 
with horse serum (5-20X, v/v), penicillin (50 U/ml), streptomycin (50 
yg/ml)f fresh L-glutaraine (300 ug/ml) and 2-mercaptoethanol (25 x 10^ 
H). Cells are subcultured every 2-3 days. Cells are counted and 

4 

seeded into culture flasks in 25-50 ml of medium at about 1-4 x 10 

cells/ml. The cells double every 12-24 hours at a relatively constant 

growth rate, and are maintained at 35-38^C in a humidified atmosphere 

of 5X CO- in air. To assay hG-CSF activity, 32D cells are plated in 

4 

96-vell flat bottom microtiter plates (2 x 10 cells/veil) in a volume 
of 200 yl, together vith a twofold dilution series of samples to be 
tested, and then incubated for 24-48 hours at 37®C in a humidified 

atmosphere of 5X CO^ in air. Cells are pulsed for approximately 16 

3 

hours with 0.5 uCi of tritiated thymidine ("^H-Tdr, New England 
Nuclear, Boston, MA, 20 Ci/mM specific activity), harvested on glass 
fiber filter strips, and then counted by liquid scintillation 
counting. 32D cells incubated with hG-CSF incorporate radiolabel in a 
dose-dependent manner. 32D cells cultured in the absence of 
hematopoietic growth factors incorporate only background levels of 
^H-Tdr. Units of activity are calculated as the reciprocal dilution 
of a sample providing 50% maximal thymidine incorporation. 

The hG-CSF Native Sequence 
The nucleotide and deduced amino acid sequences of a DNA 
segment containing the hG-CSF gene isolated as described below is set 
forth in FIG. 1. In FIG. 1, nucleotides are numbered beginning with 
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the ACC codon corresponding to the N- terminal threonine 

of the mature native protein. Similarly, amino acids are numbered 

from this threonine residue. The native polypeptide includes a leader . , 

sequence \*ich is cleaved upon secretion to provide mature protein. 

The native protein includes an Arg-Lys pairing at position 22, which 

is susceptible to cleavage by the KEX2 protease of S. cerevlsiae. 

In accordance with the present invention, site-specific V; c 
mutagenesis procedures are employed to inactivate KEX2 protease jt;;.^^ 
processing sites by deleting, adding, or substituting residues to 
alter Arg-Arg, Arg-Lys, or Lys-Arg pairs to eliminate the occurrence 
of these adjacent basic residues. Lys-Lys pairings are considerably 
less susceptible to KEX2 cleavage, and conversion of Arg-Lys or 
Lys-Arg to Lys-Lys represents a conservative and preferred approach to 
inactivating KEX2 sites. The resulting muteins are less susceptible 
to cleavage by the KEX2 protease at locations other than the yeast 
a-f actor leader sequence, where cleavage upon secretion is intended. 

The native hG-CSF sequence also includes an Arg-Arg pairing 
at residues 146-147. Muteins wherein this KEX2 site is inactivated 
are also within the scope of the present invention. 

PIG. 1 also depicts a synthetic sequence of nucleotides 1-145 
which was spliced to a C-terminal fragment of the native hG-CSF gene 
to provide a preferred mutant nucleotide sequence. This synthetic 
sequence varies from the native sequence by an "A/T enriched" 
N-terminal region wherein the proportion of deoxyadenosine and 
deoxythymidine residues is increased from 36% to 57%, promoting 
efficient expression in E. coll . In one embodiment of the present 
invention, this mutant sequence is linked to a yeast a-factor leader 
sequence, optionally via an N-terminal fusion construct comprising a 
nucleotide encoding the peptide Asp-Tyr-Lys-Asp-Asp-Asp-Asp-Lys 
(DYKDDDDK). The latter sequence is highly antigenic and provides an 
epitope reversibly bound by specific monoclonal antibody. This 
sequence is also specifically cleaved by bovine mucosal enterokinase 
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at the residue Immediately following the Asp-Lys pairing. Fusion 
proteins capped with this peptide are also resistant to intracellular 
degradation prior to secretion* 

5 Construction of Analog Sequences and Muteins 

The characteristic feature of the mutant proteins and 
nucleotide sequences of this invention is the conversion of the native 
Arg-Lys pairing at residues 22 and 23 to a configuration less 
susceptible to KEX2 protease cleavage. However, numerous DNA 
10 constructions including all or part of the nucleotide sequences 

depicted in FIG. 1, in conjunction with oligonucleotide cassettes , 
comprising additional useful restriction sites, can be prepared as a 
matter of convenience. Expression vectors and systems comprising such 
constructions, as well as hG-CSF muteins produced by such systems, are 
15 within the scope of the present invention. 

Mutations can be introduced at particular loci by 
synthesizing oligonucleotides containing a mutant sequence, flanked by 
restriction sites enabling ligation to fragments of the native 
sequence. Following ligation, the resulting reconstructed sequence 
20 encodes a mutein having the desired amino acid insertion, 
substitution, or deletion. 

Alternatively, oligonudeotide-directed site-specific 
mutagenesis procedures can be employed to provide an altered gene 
having particular codons altered according to the substitution, 
25 deletion, or insertion required. Valder et al.. Gene A2:133 (1986); 
Bauer et al., Gene 37:73 (1985); Craik, Bio techniques, January 1985, 
12--19; Smith et al.. Genetic Engineering: Principles and Methods 
(Plenum Press, 1981); and U. S. Patent 4,518,584 disclose suitable 
techniques, and are incorporated by reference herein. 
3Q For either approach, conventional techniques for 

oligonucleotide synthesis are suitable, for example, the triester 
synthesis procedures disclosed by Sood et al., Nud. Acid Res. 4:2557 
(1977) and Hirose et al., Tet. Lett. 28:2449 (1978). 
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In site-specific mutagenesis, a strand of the gene to be 
altered is cloned into an H13 single-stranded phage or other 
appropriate vector to provide single-stranded (ss) DNA comprising 
either the sense or antisense strand corresponding to the gene to be 
altered. This DNA is then hybridized to an oligonucleotide primer 
complementary to the sequence surrounding the codon to be altered, but 
comprising a codon (or an antisense codon complementary to such codon) 
specifying the new amino acid at the point where substitution is to be 
effected. If a deletion is desired, the primer will lack the 
particular codon specifying the amino acid to be deleted, while 
maintaining the correct reading frame. If an insertion is desired, 
the primer will include a new codon, at the appropriate location in 
the sequence, specifying the amino acid to be inserted. Preferably, 
the substitute codon, deleted codon, or inserted codon is located at 
or near the center of the oligonucleotide. 

The size of the oligonucleotide primer employed is determined 
by the need to optimize stable, unique hybridization at the mutation 
site with the 5' and 3' extensions being of sufficient length to avoid 
editing of the mutation by exonudeases . Thus, oligonucleotides used 
in accordance with the present invention will usually contain from 
about 15 to about 25 bases. Oligonucleotides of greater size are not 
needed . 

In a preferred approach (see Valder et al., supra), the 
resulting oligonudeotide/ss vector hybrid is directly transformed 
into yeast. Alternatively, a mutagenic primer is hybridized to a 
gapped duplex having a single-stranded template segment containing the 
gene to be altered. In the latter case, the primer is extended along 
the template strand by reaction with DNA polymerase I (Klenow 
fragment), T4 DNA polymerase, or other suitable DNA polymerase, 
providing a resulting double stranded DNA which is circularized and 
used to transfect a suitable host strain. In both cases, replication 
of the heteroduplex by the host provides progeny of both strands. In 
B. coli, transf acted cells are plated to provide colonies, which are 
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screened using a labelled oligonucleotide corresponding to that used 
in the mutagenesis procedure. If yeast are transformed directly, 
trans formants are pooled, DNA isolated and transformed into E. coli. 
The resulting colonies are screened by hybridization. Conditions are 
employed which result in preferential hybridization of the primer to 
the mutated I»iA but not to the progeny of the pacoit strand. DNA 
containing the mutated gene is then isolated and spliced into a 
suitable expression vector, and the vector used to transform a host 
strain. The host strain is then grovn in culture to provide the 
analog protein. 

Protein Expression in Recombinant Yeast Systems 
least. systems are preferred for expression of the recombinant 
analog proteins of this invention, e.g., the hG-CSFlLys^^l mutein. An 
exemplary expression vector is pBC102-K22 (ATCC 67,255) which contains 
MIA sequences from pBR322 for selection and replication in coli 
(Ap^ gene and origin of replication) and yeast DNA sequences including 
a glucose-repressible alcohol dehydrogenase 2 (ADH2) promoter. The 
ADH2 promoter has been described by Russell et al. , J. Biol. Chem. 
258:2674 (1982) and Beier et al.. Nature 300:724 (1982). Plasmid 
pBC102'K22 also includes a Trpl gene as a selectable marker and the 
yeast 2u origin of replication. Adjacent to the promoter is the yeast 
o-factor leader sequence enabling secretion of heterologous proteins 
from a yeast host. The ct-f actor leader sequence is modified to 
contain, near its 3' end, an Asp718 (Kpnl and Asp718 are 
isoschisomers) restriction site to facilitate fusion of this sequence 
to foreign genes, and the DYKDIM)DK sequence described above. To 
allow efficient processing of secreted protein, as described by Brake 
et al., Proc. Natl. Acad. Sci. USA 81:4642 (1984), a sequence coding 
for the Glu-Ala-Glu-Ala amino acids was omitted. Details regarding 
construction of this plasmid are provided below. 

Alternative expression vectors are yeast vectors which 
comprise an a-f actor promoter, for example pYoHuGH (ATCC 53157), which 
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bears the wild- type human GM-CSF gene. Others are known to those 
skilled in the art. The construction of pYoHuGM is described in 
EP*A-01 83350, the disclosure of which is incorporated by 
reference herein. 

The choice of appropriate yeast strains for transformation 
will be determined by the nature of the selectable markers and other 
features of the vector. Appropriate cerevisiae strains for 
transformation by pBC102-K22 or pYoHuGM, and various constructions 
derived from those vectors, include strains X2181-1B, available from 
the Yeast Genetic Stock Center, Berkeley, CA, USA [see belovj, having 
the genotype a tr£l gall adel his2; J17 (ATCC 52683; a his2 adel trpl 
tnetl4 ura3 ); and IL166-5B (ATCC 46183; a hisl trpl ). A particularly 
preferred expression strain, XV2181, is a diploid formed by mating two 
haploid strains, X2181-1B, available from the Yeast Genetic Stock 
Center, Department of Biophysics and Medical Physics, University of 
California, Berkeley, CA 94702, USA; and XV617-1-3B, available from 
the Department of Genetics, University of Washington, Seattle, WA 
98105, USA, or Immunex Corporation, 51 University Street, Seattle, WA 
98101, USA. A suitable transformation protocol is that described by 
Hinnen, et al., Proc. Natl. Acad. Sci. USA 75:1929 (1978), selecting 
for Trp* transformants in a selective medium consisting of 0.67Z yeast 
nitrogen base, 0.5% casaraino acids, ZX glucose, 10 yg/ml adenine and 
20 pg/ml uracil. 

Host strains comprising pBC102-K22 or other constructions 
comprising the ADH2 or a-factor promoters are grown for expression in 
a rich medium consisting of 1% yeast extract, 2Z peptone, and IX 
glucose supplemented with 80 yg/ml adenine and 80 pg/ml uracil. 
Derepression of the ADH2 promoter occurs upon exhaustion of medium 
glucose. 

Example 1: Purification of hG-CSF from HBT 5637 Conditioned Medium 
HBT 5637 adherent human bladder carcinoma cells (ATCC No. 
HTB9) were grown to confluence in RPHI-1640 medium supplemented with 
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lOX PCS. Once the cells were at confluence, Bediura from the cells vas 
decanted and replaced vlth fresh RFHI-1640 nediun containing Q.IX PCS. 
The cells were then cultured for approximately 72 hours in a 
humidified atmosphere of 5X COj in air. Thereafter, the medium vas 
harvested and centrifuged at 2000 Xg to remove floating or dying 
cells. 

HBT 5637 conditioned medium as prepared above vas initially 
concentrated by ammonium sulfate precipitation. The medium vas 
brought to 80% saturation by adding solid ammonium sulfate at A'^C vith 
gentle stirring. The ammonium sulfate vas added over an extended 
time period of approximately 12 hours. At 80Z saturation, gentle 
stirring vas continued for an additional 4 to 8 hours and the 
resulting mixture centrifuged at 10,000 Xg for 30 minutes. The final 
protein pellet vas then resuspended in PBS (phosphate buffered 
saline), pH 7.2, and dialyzed against multiple volumes of PBS, pB 7.2 
for 12 to 24 hours at 4**C. The dialyzed concentrate vas further 
purified by gel-filtration chromatography vith a 2.5 x 90 cm AcA-54 
ultragel column (an acrylamide-agarose ultragel having 100 to 10 It 
Dal ton fractionation, LKB, Bromma, Sveden). Prior to loading, the 
column was equilibrated vith PBS buffer. Fractions vere eluted from 
the column vith PBS containing 20 |ig/ml gentamycin and assayed for 
biological activity. Active fractions eluted from the column at an 
apparent molecular veight of 15,000-30,000 daltons. 

Active fractions from the gel chromatography step vere pooled 
and directly applied to a Waters Radiopak HPLC column (Waters 
Associates), 8 mm x 10 cm, packed with Vydac C4 resin, 15-20 micron 
particle size, and equilibrated with 0.1% TEA (v/v). Total protein 
applied was generally about 4-40 mg in a volume of about 20-300 ml. 
The loaded column was washed with O.IZ TFA (v/v) until absorbence at 
214 nm reached baseline values, then eluted with a linear gradient of 
0-lOOX acetonitrile in O.U TFA (v/v) at IX acetonitrile per minute. 
One minute fractions were collected and assayed for biological 
activity. Fractions containing activity were pooled, diluted with 10 
ml of buffer A (0.9 M acetic acid, 0.2 M, pyridine, pH 4.0) and 
reapplied to the Radiopak HPLC column, previously equilibrated with 
buffer A. The column was washed with buffer A to remove non-bound 
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components, then eluted with a 0-84Z gradient of buffer B (60% 
N-propanol In 0.9 M acetic acid, 0.2 M pyridine, pH 4.0). An initial 
0-20X buffer B gradient vas applied to the column over a period of 10 
minutes and then a second gradient of 20-842 buffer B was applied over 
110 minutes. The column vas pumped at a rate of 1 ml/min. during 
which time 1.5 minute fractions were collected and assayed. SDS-PAGE 
analysis of the active fractions Indicated a protein band of 
approximately 19,000 HV, 

Fractions containing activity were pooled, diluted with 10 ml 
of 0.1% (v/v) aqueous TFA, and applied to a Vydac CIS column (Waters 
Associates, 3.9 mm x 30 cm, 10 micron particle size) equilibrated with 
0.1% (v/v) aqueous TFA. The column vas eluted vith a linear gradient 
of 0-100% (v/v) acetonitrile in 0.1% TFA at a rate of 1% per minute 
(100 ml total). One minute fractions vere collected and assayed. 
Homogeneity of the protein component of the active fraction vas 
confirmed by SDS-PAGE and silver staining vhich yielded a single band 
at approximately 19,000 daltons. The specific activity of this 
homogeneous material vas determined to be approximately 10 U/yg by 
the 32D assay and 10^ CFU/yg by murine and human bone marrov colony 
assays . 

The protein sample purified by the foregoing HPLC steps vas 
sequenced using an automated amino terminal Edman degradation using an 
Applied Biosystem Model 470 Approaching Sequencer. In a first 
sequencing attempt using approximately 310 picomole of protein, 85% of 
the sequence could be assigned to a single protein. A second 
confirming run, using approximately 500 picomole of material, vas then 
made. The resulting N-terminal sequence vas Thr-Pro-Leu-Gly-Pro-Ala- 
Ser-Ser-Leu-Pro-Glu-Ser-Phe-His-Cys-Lys-Asn-Leu-Glu-Gln-Val-Arg-Ile- 

Lys-Ile. 
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Example 2: Isolation of cDNA encoding hG-CSF, Site-Specific 
Mutagenesis, and Yeast Expression of Active Mu tant Protein 

Synthetic oligonucleotides, designated A, B, and C, were 
constructed with sequences complementary to three selected sequences 
of the human G-CSP gene. Probe A, complementary to the sequence 
encoding part of the leader and first amino acid (Thr) of the mature 
G-CSF protein, had the sequence 5'-G6TGGCTTCCTGCACrGTCCA-3». Probe B, 
which corresponded to the region encoding amino acids 36-42 of the 
mature protein, had the sequence 5'-GIAGGrGGCACACICACTCAC-3' . Probe 
C, complementary to a 3' noncoding region of the native gene, had the 
sequence 5'-AACTCAGAAATGCAGGGAAGG-3' . The method of synthesis was a 
standard automated triester method substantially similar to that 
disclosed by Sood et al.. Nucleic Acids Res . 4:2557 (1977) and Hirose 
et al., Tet. Lett. 28:2449 (1978). Following synthesis, the 
oligonucleotide was deblocked and purified by preparative gel 
electrophoresis. The oligonucleotides were terminally radiolabelled 
using ^^P-ATP and T4 polynucleotide kinase by standard techniques, 
such as those disclosed by Maniatis et al., Molecular Cloning: A 
Laboratory Manual (Cold Spring Harbor Laboratory 1982), for use as 
screening probes. 

A cWIA library was constructed by reverse transcription of 
polyadenylated mBNA Isolated from total RNA extracted from a human 
bladder carcinoma cell line, HBT 5637 (ATCC BBT-9). The cDNA was 
rendered double-stranded using MIA polymerase I, blunt-ended with T4 
DNA polymerase, methylated with EcoRI methylase to protect EcoRI 
cleavage sites within the cMlA, and ligated to EcoRI linkers. The 
resulting constructs were digested with EcoRI to remove all but one 
copy of the linkers at each end of the cDNA, and ligated to EcoRI-cut 
and dephosphorylated arms of bacteriophage XgtlO (Huynh et al., DNA 
Cloning: A Practical Approach . Glover, ed. , IRL Press pp 49-78). The 
ligated DNA was packaged into phage particles to generate a library of 
750,000 recombinants. 150,000 recombinants were plated on E. coli 
strain C600hfl" and screened by standard plaque hybridization 
techniques with the labeled oligonucleotide probes. Two clones were 
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isolated from the library which hybridized to all three probes. These 
were plaque purified and used to prepare bacteriophage DNA which was 
digested with EcoRI. The digests were electrophoresed on an agarose 
gelf blotted onto nylon filters, and retested for hybridization. One 
clone was partially digested with EcoM followed by preparative 
agarose gel electrophoresis, then subcloned into an EcoRI-cut 
derivative of the standard cloning vector pBR322 containing a 
polylinker having a unique EcoRI site, a BamHl site and numerous other 
unique restriction sites. An exemplary vector of this type is 
described by Dente et al., Nucleic Acids Research 11:1645 (1983). 
Restriction mapping indicated the presence of restriction sites 
corresponding to those previously reported for human G-CSF. 

Oligonucleotides were synthesized which encoded the 
C-terminal 5 amino acids of the yeast a-factor leader peptide, 
beginning with the Kpnl site, and the H-terminal 46 amino acids of 
hG-CSF fused in-frame to the a-factor leader sequence. In FIG. 2, the 
synthetic N-terminal hC-CSF sequence is indicated as fragment A. This 
sequence included, at its C-terminal, an Aval site and a cohesive end 
corresponding to a PstI restriction site, in addition to added 
internal Hindlll and Xbal sites. This fragment lacked three codons 
corresponding to an internal Val-Ser-Glu sequence beginning at residue 
36, and thus converted the +3 version of hG-CSP to the more active -3 
version. See Nagata et al., Nature 319:415 (1986). The resulting 161 
base pair Kpnl-Pstl fragment, constructed from 8 oligomers of 
approximately 40 nucleotides each, was cloned into a pBR322 
derivative, pUC9, cut with Kpnl and Pstl. 

Following transformation of a suitable E. coli strain with 
each of the plasmids described above, DNA was purified using standard 
techniques and digested as follows. The pUC9 vector containing the 
synthetic N-terroinus (fragment A) was digested with Kpnl and Aval and 
a 145 base pair fragment recovered. The vector (pGEMBL in FIG. 2) 
containing the hG-CSF cDNA was first digested with StuI and then 
partially digested with Aval. An approximate 310 base pair Aval-StuI 
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fragment encoding the C*teraiinal amino acids 43-174 of hG-CSF was 
isolated from the partial digest. These fragments were ligated to a 
Kpnl and StuI cut yeast expression vector, pBClOf vhich comprises the 
ADH2 promoter, a-f actor leader sequence i yeast and E. coli origins of 
replication, yeast Trpl gene, and E. coli-Ap^ gene. The resulting 
construct vas designated pBC88. 

pBC88 vas digested with Asp718, vhich cleaves at Kpnl sites, 
and Apal, vhich cleaves at nucleotide 13 of the hG-CSF gene. The 
resulting large fragment vas then ligated to the folloving 
oligonucleotide (oligo B in FIG. 2), vhich includes a part of the 
C-terminus of the a-factor leader sequence terminating in a KEX2 
protease recognition site, an 8 codon sequence encoding an antigenic 
hydrophilic synthetic N-terminus (DYKDDDDK) capable of cleavage by 
bovine enterokinase, and a short sequence encoding the hG-CSF protein 
up to and including an Apal cohesive end. The sequence of 
oligonucleotide B, as veil as its corresponding amino acid sequence, 
are set forth belov: 

-12 -11 -10 -9 -8 -7 .6-5-4-3-2-11234 
Pro Cys Asp Lys Arg Asp Tyr Lys Asp Asp Asp Asp Lys Thr Pro Leu Gly 
GTA CCT TTG GAT AAA A6A GAC TAC AAG GAC GAC GAT GAC AAG ACT CCT CT6 GGC C 
GA AAC CTA TTT TCT CTG ATG TTC CTG CTG CIA CTG TTC TGA GGA GAC 

The resulting construction vas designated pBC90. 

In vitro mutagenesis vas conducted by a method similar to 
that described by Walder and Walder, Gene 42:133 (1986), but vith the 
folloving differences. The expression vector pBC90 vas modified to 
include a DM fragment encoding the origin of replication for the 
single stranded bacteriophage fl, derived from the pEMBL vector 
described by Dente et al., Nucleic Acids Research 11:1645 (1983). 
This vas achieved by cleaving pBC90 at a unique Nrul site, and 
ligating the cleaved plasmid to a blunt-ended 514 bp Rsal fragment 
comprising the fl origin. The resulting construct, designated pBC102, 
is capable of expressing G-CSF in yeast and also capable of generating 
single-stranded DNA vhen present in a suitable (male) strain of 
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E. coll and superinfected with helper phage. 

Single-stranded DNA was generated by transforming E. coli 
strain JH107 and superinfectlng vith helper phage IRl. 
Single-stranded DNA vas isolated and annealed to the mutagenic 
oligonucleotide 5'-GAACAAGTTAAAAAAATCCAGC-3S (oligo D) vhich provides 
a codon switch substituting Lys for Arg at a position corresponding to 
residue 22 of mature native hG-CSF. Annealing and yeast 
transformation conditions vere substantially similar to those 
disclosed by Valder and Valder, supra > Teast transformants vere 
selected by growth on medium lacking tryptophan, pooled, and DNA 
extracted substantially as described by Holm et al.. Gene 42:169 
(1986). This DNA, containing a mixture of wild- type and mutant 
plasmid DNA, was used to transform E. coli RRl to ampicillin 
resistance. The resulting colonies were screened by hybridization to 
radiolabeled oligonucleotide D, using standard techniques. Plasmids 
comprising DNA encoding hG-CSFlLys^^l were identified by the 
hybridization to radiolabeled oligonucleotide D under stringent 
conditions (e.g., 55"C, 6xSSC), and verified by nucleotide sequencing. 

Plasmid DNA from an E. coli clone containing the 
hG-CSF[Lys^^] gene was then isolated and used to transform yeast 
strain XV2181 for expression of the altered gene product. 
Supernatants produced by the transformed yeast strain grown under 
conditions promoting derepression of the ADH2 promoter were then 
assayed by reaction with a mouse monoclonal antibody followed by 
horseradish peroxidase-conjugated goat anti-mouse antibody to detect 
the presence of the DYKDDDDK N- terminal peptide, and by bioassay for 
G-CSF activity. These experiments indicated approximately 5-fold 
greater expression of the mutein relative to parallel expression 
experiments involving the native protein. 

Example 3; High Level Expression of hGSF in E. coli 
Plasmid pBCSS, assembled as described in Example 2, was 
digested with Apal and StuI to provide an approximate 427 base pair 
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fragment encoding the C-terminal amino acids 4-174 of aature hGCSF. 
This fragment is characterized by an "A/T enriched" N-terminal region, 
vherein the proportion of deoxy adenosine and deoxythymidine residues 
in the N-terminal 40 codons is increased relative to the native cDNA 
sequence. 

A similar Apal-StuI fragment was isolated from the cloning 
vector comprising the native hG-CSF sequence (pGEMBL in FIG. 2) to 
provide the region of the native cIWA sequence corresponding to the 
A/T-enriched version. 

The native and synthetic fragments were each ligated to the 
following oligonucleotide E, which provides cohesive Clal and Apal 
terminals and codons encoding an N-terminal methionine residue and the 
first three amino acids of mature hG-CSF: 

Clal Apal 
C GAT ACT ATG ACT CCT CTG GGC C 
TA TGA TAC TGA GGA GAC 

Met Thr Pro Leu Gly 

The resulting fragments were ligated to a Clal, Smal-cut 
derivative of pGEHl (Promega Biotec, Madison, Visconsin, USA) 
with a DNA fragment containing a temperature-sensitive X Pj^ promoter, 
This fragment is substantially similar to that described in detail by 
Course et al., Proc. Natl. Acad. Sci. USA 82:1069 (1985). A related X 
?^ promoter sequence is present on the plasmid resident in coll 
strain JMB9, which has been deposited with the American Type Culture 
Collection (ATCC) under accession number ATCC 37092, and on plasmid 
pPL28, resident in E. coli RRl and deposited as ATCC 53082. 

This expression vector, designated pPL3, was further modified 
by replacement of the commercial multiple cloning site with the 
following KpnI-Hindlll fragment: 
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Kpnl Clal 
GG GTA CCA AGT TCA CGT AAA AAG GGT ATC GAT ACT ATG GAC TAC AAA GAT 
CC CAT GGT TCA AGT GCA TTT TTC CCA TAG CTA TGA TAC CTG ATG TTT CTA 

EcoRV Ncol Smal 
GAG GAT ATC CAT GGA ATT CGA GCT CGC CCG GGG ATC CTC TAG AGT CGA CCT 
CTG CTA TAG GTA CCT TAA GCT CGA GC6 GGC CCC TAG GAG ATC TCA GCT GGA 

HindZII 
GCA GCC CAA GCT TGG 
CGT CGG GTT GGA ACC 

The three DNA fragments vere ligated with T4 DNA ligase to 
provide intact expression plasmids identical except for the 
replacement of the first 128 base pairs of mature h6-CSF by a 
synthetic DNA segmient. This segment has an A/T content enriched from 
362 to 57% as described above and illustrated in FIG. 1. The native 
construct vas designated pFL3/nG-CSF, while the A/T enriched version 
vas designated pFL3/sG-CSF. 

A third construction, designated pPL3/FLAG-sG-CSF, vas 
assembled from the 427 base pair Apal-StuI sG-CSF fragment fused to 
the following Clal-Apal oligonucleotide F. This fragment encodes an 
N-terminal methionine fused to a AspTyrLysAspAspAspAspLys (DYKDDDDK) 
hydrophobic leader sequence and the first three amino acids of mature 
hG-CSF. The DYKDDDDK sequence provides a highly antigenic leader 
capable of being bound by specific antibody and cleaved by purified 
bovine or porcine mucosal enterokinase, and is also capable of 
enhancing expression of fusion proteins. 

Clal Apal 
C GAT ACT ATG GAC TAC AAA GAT GAC GAT GAC AAA ACT CCT CTG GGC C 
TA TGA TAC CTG ATG TTT CTA CTG CTA CTG TTT TAC TGA GGA 

Met Asp Tyr Lys Asp Asp Asp Asp Lys Thr Pro Leu Gly 

Finally, a fourth construction, designated 
pPL3/F-stop-sG-CSF, was assembled from the 427 base pair Apal-StuI 
sG-CSF fragment fused to the following Clal-Apal oligonucleotide G, 
which encodes an N-terminal methionine fused to a AspTyrLysAspAspAsp 
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(DYKDDD) hydrophobic leader sequence which had previously been noted 
to provide enhanced expression of fusion proteins, a stop codon, a 
second N-terminal methionine, and the first three amino adds of 
mature hG-CSP: 

Clal *P*^ 
C GAT ACT ATG GAC TAG AAA GAT GAG GAT TAA CAT ATG ACT CCT CTG GGC C 
TA TGA TAG CTG ATG TTT CTA CTG CTA ATT 6IA TAC TGA GGA GAC 
Het Asp Tyr Lys Asp Asp Asp End Het Thr Pro Leu 

Each of the foregoing expression vectors was then used to 
transform E^ coll strain K802 (pRK248cIts; ATCC 33526) to tetracycline 
and ampicillin resistance. To test expression of proteins having 
G-CSF activity, the transformants were grown in L-broth without 
ampicillin at 30*C to an absorbance at 600 nm of about 0.4-0.5. The 
cultures were then shifted to 43"C for 1 hour to promote derepression 
of the X-P. promoter. After three hours, cells from 1 mL culture were 
collected by centrifugation and the pellet extracted in 300 pi 
guanidine-HCl. The resulting extract was diluted 1:100 in 
phosphate-buffered saline, and assayed in the 32D murine bone marrow 
proliferation assay. The results are set forth In the folloving 
table: 

32D Proliferation-Inducing 
Construct Activity (U/mL culture) 

pPL3 (no insert) 0 

nG-CSF If 200 

sG-CSF 130,000 

FstopG-CSF 70,000 

Flag/sG-CSF 1,600,000 

Example 4: Alteration or Addition of Cysteine Residues 
The native amino acid sequence of hG-CSF comprises five 
cysteine residues, located at positions 17, 36, 42, 64, and 74 of the 
mature hG-CSF molecule. In order to test whether alteration of the 
native sequence (Arg^^) by elimination of an odd-numbered cyteine 
would enhance expression by stabilizing the resulting 4-cysteine 
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versloni site-specific mutagenesis as previously described vas 
employed to convert the codon encoding Cys^^ from TGT to ACT, thus 
specifying serine in place of cysteine. The mutagenic 
oligonucleotide employed corresponded to the native sequence from 
5 nucleotide 39 through nucleotide 60, vith the exception of the 

substitution of deoxyadenosine for deoacythymidine at position 49. The 
resulting construct, assembled as a fusion protein having a DYKDDDDK 
N- terminal leader, vas reinserted into a yeast expression vector 
substantially similar to pBC102-K22 except for identity of the target 

10 sequence. This vector, designated pBC122, vas used to transform a 
suitable yeast host as previously described. The resulting 
transformants vere then grovn under conditions promoting derepression 
of the ADB2 promoter. Assay results are described belov. 

In a related experiment, Phe vas replaced by a cystein 

15 residue, to test the hypothesis that addition of a cysteine residue to 
provide a total of six cysteine residues vould also enhance expression 
and recovery by stabilizing the molecule. Again, site-specific 
mutagenesis vas employed to convert the codon encoding Phe^^ from TTC 
to TGC, thereby specifying cysteine. The mutagenic oligonucleotide 

20 employed corresponded to the native sequence from nucleotide 235 
through nucleotide 261, vith the exception of the substitution of 
deoxyguanosine for deoxythymidine at position 248. The resulting 
construct, incorporating a DYKDDDDK N-termlnal leader, vas reinserted 
into a yeast expression vector as described above. This vector, 

25 designated pBC121, vas used to transform a suitable host, vhich vas 
grovn under conditions promoting derepression of the A0H2 promoter. 
Assay results are indicated in the foUoving table. 

Teast culture supernatants from comparable fermenations vere 
assayed at appropriate dilutions using the 32D murine bone marrov 

30 proliferation assay. In the table, pBC90 refers to a vild-type 

protein incorporating the DYKODODK leader, and pBCllO indicates a 
22 

Lys derivate of pBC102 also incorporating a DYKDDDDK leader. 
Immuno-dot blot assays using monoclonal antibody specific for the 

35 
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DYKDDDDK leader indicated qualitatively similar levels of 
antibody-reactive protein in each culture sample Indicated belov: 



32D Proliferation-Inducing 
Construct Activity in 1 ml Culture Supernatant 

pBC90 439,667 
pBCllO 820,492 

pBC121 (6-Cys) 1,087,476 

pBC122 (4-Cys) 1,400,864 



Therapeutic Applications 
Hematopoietic growth factor (hG-CSF) may be employed to treat 
various stem and progenitor cell suppressions, such as those caused by 
chemotherapy and radiation therapy in cancer patients. Also, the 
factor may be used for the treatment of leukemia and various anemias. 
For such treatments, dosages in the range of 1 to 1 x 10 vg per 
patient per treatment may be used. Administration may be by any 
suitable method, e.g., by injection In a pharmacological carrier, such 
as saline or saline mixed vith human serum albumin. 
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CLAIMS FOR THE CONTRACTING STATES: BE, FR, DE, IT, LU, 
NL, SE, CH, LI and 6B. 

1 • Purified or substantially pure human granulocyte 
5 colony stimulating factor (hG-CSF). 

2m A human G-CSF (hG-CSF) routein wherein at least one 
yeast KEX2 protease processing site has been 
inactivated* 

3« An hG-CSF muteln having an amino acid sequence 
which is substantially homologous to the synthetic 
amino acid sequence of FIG 1 • 

15 4« An hG-CSF mutein wherein Cys^' of the native 
sequence has been replaced by a non-Cys residue. 

5. An hG-CSF mutein wherein Phe^^ of the native 
sequence has been replaced by a Cys residue. 

20 

6. A DNA segment encoding a human G-CSF protein, said 
segment characterized by increased deoxyadenosine and 
deoxythymidine content relative to the native hG-CSF 
sequence. 

25 

7. A DNA segment according to claim 6, having 
substantial homology to the synthetic DNA sequence of 
FIG 1. 

30 8. A DNA segment encoding encoding a protein 
according to any one of claims 2 to 5. 
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9. A crude culture supernatant comprising a protein 
according to any one of claims 1 to 3 at a protein 
concentration of at least 10 mcg/ml. 

g 10. A recombinant yeast expression vector comprising a 
DNA segment encoding hG-CSF or an hG-CSP mutein. 

11. A recombinant microbial expression vector 
comprising a DNA segment according to any one of claims 

10 « *° ®- 

12, A recombinant expression vector which is or is 
derived from pBC102.K22 (ATCC 67,255) • 

13* A process for preparing hG-CSF or an h-GSF mutein, 
comprising culturing a host microorganism comprising an 
expression vector according to any one of claims 10 to 
12 under conditions promoting expression. 

20 



25 



30 



31 



0243153 



CLAIMS FOR THE CONTRACTING STATES: AT and ES. 

1 . A process for the preparation of substantially 
pure or purified human granulocyte colony stimulating 

5 factor (h6-CSF), the process comprising applying a 
crude mixture containing h6-CSF to an HPLC column and 
eluting hG-CSF from the column. 

2. A process for the preparation of an hC-CSF mutein, 
^0 the process comprising coupling together by the 

formation of peptide bonds successive amino acid 
residues, wherein at least one amino acid residue at a 
yeast KEX2 processing site in the native sequence has 
been altered to render the site inactive. 

15 

3. A process for the preparation of an hG-CSF mutein 
having an amino acid sequence which is substantially 
homologous to the synthetic amino acid sequence shown 
in Figure 1 , the process comprising coupling together 

20 by the formation of peptide bonds successive amino acid 
residues. 

4. A process for the preparation of an hG-CSF mutein, 
the process comprising coupling together by the 

25 formation of peptide bonds successive amino acid 

residues, wherein Cys^^ of the native sequence has been 
replaced by a non-Cys amino acid residue. 

5. A process for the preparation of an hG-CSF mutein, 
30 the process comprising coupling together by the 

formation of peptide bonds successive amino acid 
residues, wherein Phe^^ of the native sequence has been 
replaced by a Cys residue. 
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6. A process for the preparation of a DNA sequence 
encoding a hximan G-CSF protein, the segment having 
Increased dA and dT content relative to the native 
hG-CSF sequence, the process comprising coupling 
together successive nucleotide residues. 

7. A process as claimed In claim 6, wherein the DNA 
sequence has substantial homology to the DNA sequence 
of Figure 1 • 

8. A process for the preparation of a DNA sequence 
encoding a protein preparable by a process as claimed 
In any one of claims 2 to 5, the DNA sequence 
preparation process comprising coupling together 
successive nucleotide residues • 

9. A process for the preparation of a crude culture 
supernatant comprising a protein preparable by a 
process as claimed In claim 1, 2 or 3, the supernatant 
preparation process comprising culturlng cells capable 
of expressing the protein Into the supernatant until 
the desired protein concentration in the supernatant is 
at least 10 mcg/ml. 

10. A process for the preparation of a recombinant 
yeast expression vector, the process comprising 
ligating yeast expression vector DNA and a DNA segment 
encoding hG-CSF or an hC-CSF muteln. 

11. A process for the preparation of a recombinant 
microbial expression vector, the process comprising 
ligating microbial expression vector DNA and a DNA 
segment preparable by a process as claimed in claim 6, 
7 or 8. 
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12. A process as claimed In claim 10, wherein the 
expression vector is, or is derived from, pBC102.K22 
(ATCC 67,255). 

13. A process for preparing hG-CSF or an h-GSF mutein, 
comprising culturlng a host microorganism comprising an 
expression vector produceable by a process according to 
any one of claims 10 to 12 under conditions promoting 
expression. 
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CLAIMS FOR THE CONTRACTING STATE: GR 

1 . A process for the preparation of substantially 
pure or purified human granulocyte colony stimulating 
factor (hG-CSF), the process comprising applying a 
crude mixture containing hG-CSF to an HPLC column and 
eluting hG-CSF from the column. 

2. A process for the preparation of an hG-CSF mutein, 
the process comprising coupling together by the 
formation of peptide bonds successive amino acid 
residues, wherein at least one amino acid residue at a 
yeast KEX2 processing site in the native sequence has 
been altered to render the site inactive. 

3. A process for the preparation of an hG-CSF mutein 
having an amino acid sequence which is substantially 
homologous to the synthetic amino acid sequence shown 
in Figure 1 , the process comprising coupling together 
by the formation of peptide bonds successive amino acid 
residues. 

4. A process for the preparation of an hG-CSF mutein, 
the process comprising coupling together by the 
formation of peptide bonds successive amino acid 
residues, wherein Cys''"' of the native sequence has been 
replaced by a non-Cys amino acid residue. 

5. A process for the preparation of an hG-CSF mutein, 
the process comprising coupling together by the 
formation of peptide bonds successive amino acid 
residues, wherein Phe^^ of the native sequence has been 
replaced by a Cys residue. 
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6. A DNA segment encoding a human G-CSF protein, said 
segment characterized by Increased deoxyadenosine and 
deoxythymldlne content relative to the native hG-CSF 
sequence • 

7. A DNA segment according to Claim 6, having 
substantial homology to the synthetic DNA sequence of 
FIG 1. 

8« A DNA segment encoding encoding a protein 
according to any one of claims 2 to 5. 

9* A crude culture supernatant comprising a protein 
according to any one of claims 1 to 3 at a protein 
concentration of at least 10 mcg/ml. 

10* A recombinant yeast expression vector comprising a 
DNA segment encoding hG-CSF or an hG-CSF muteln. 

11. A recombinant microbial expression vector 
comprising a DNA segment .according to any one of claims 
6 to 8. 

12. A recombinant expression vector which is or is 
derived from pBC102.K22 (ATCC 67,255) • 

13. A process for preparing hG-CSF or an h-GSF mutein, 
comprising culturing a host microorganism comprising an 
expression vector according to any one of claims 10 to 
12 under conditions promoting expression* 
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PIG. 1; COMPARISON OP hG-CSP SEQUENCES 

N - Native hG-CSP; S « Synthetic sequence. 

CAG CTG CTG CTG TGG CAC ACT GCA CTC TG6 ACA 6TG CAG^GAA 6CG 

ACT CCT CTG GGC CCT GCT TCT TCT CTG CCT CAA TCT TTT CTG CTC 45 
ACC CCC CTC GGC CCT GCC AGC TCC CTG CCC CAG AGC TTC CTG CTC 45 
Thr Pro Leu Gly Pro Ala Ser Ser Leu Pro Gin Ser Phe Uu Uu 15 



AAA TGT CTA GAA CAA GTT AAA AAA ATC CAG GGC GAT GGT GCA GCT 90 
AAG TGC TTA GAG CAA GTG AGG AAG ATC CAG GGC GAT GGC GCA GCG 90 
Lys Cys Leu Glu Gin Val Arg Lys He Gin Gly Asp Gly Ala Ala 30 
Ser 

CTT CAA GAA AAA CTG TGT GCT ACT TAT AAG CTT TGT CAT CCC GAG 135 
CTC CAG GAG AAG CTG TGT GCC ACC TAC AAG CTG TGC CAC CCC GAG 135 
Leu Gin Glu Lys Leu Cys Ala Thr Tyr Lys Leu Cys His Pro Glu 45 

GAG CTG GTG CTG CTC GGA CAC TCT CTG GGC ATC CCC TGG GCT CCC 180 
Glu Leu Val Leu Leu Gly His Ser Leu Gly He Pro Trp Ala Pro 60 

CTG AGC AGC TGC CCC AGC CAG GCC CTG CAG CTG GCA GGC TGC TTG 225 
Leu Ser Ser Cys Pro Ser Gin Ala Leu Gin Leu Ala Gly Cys Leu 75 

A TGC * 

AGC CAA CTC CAT AGC GGC CTT TTC CTC TAC CAG GGG CTC CTG CAG 270 
Ser Gin Uu His Ser Gly Leu Phe Uu Tyr Gin Gly Uu Uu Gin 90 

Cys 

GCT CTG GAA GGG ATA TCC CCT GAG CTC GGT CCT ACC TTG GAT ACA 315 
Ala Uu Glu Gly He Ser Pro Glu Uu Gly Pro Thr Uu Asp Thr 105 

CTG CAA CTG GAT GTT GCT GAT TTT GCT ACT ACT ATT TGG CAA CAA 360 
Uu Gin Leu Asp Val Ala Asp Phe Ala Thr Thr He Trp Gin Gin 120 

ATG GAA GAA CTG GGA ATG GCT CCT GCT CTG CAA CCT ACT CAA GGT 405 
Met Glu Glu Leu Gly Met Ala Pro Ala Uu Gin Pro Thr Gin Gly 135 

GCC ATG CCG GCT TTC GCG AGT GCT TTT CAA CGT CGT GCA GGA GGG 450 
Ala Met Pro Ala Phe Ala Ser Ala Phe Gin Arg Arg Ala Gly Gly 150 

GTC CTT GTT GCT TCT CAT CTG CAG AGT TTT CTT GAA GTT TCT TAT 495 
Val Uu Val Ala Ser His Leu Gin Ser Phe Uu Glu Val Ser Tyr 165 



CGT GTT CTA CGT CAT CTT GCT CAA CCA TGA AGG CCT CCA TGG GCA 540 
Arg Val Leu Arg Bis Leu Ala Gin Pro End < 174 
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